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MICHAEL HOCHMAN 

Abstract. We show that any direction in the plane occurs as the unique non- 
expansive direction of a Z 2 action, answering a question of Boyle and Lind. In 
the case of rational directions, the subaction obtained is non-trivial. We also 
establish that a cellular automaton can have zero Lyapunov exponents and at 
the same time act sensitively; and more generally, for any positive real 9 there 
is a cellular automaton acting on an appropriate subshift with A+ = — A~ = 9. 



1. Introduction 

Consider a Z 2 action (X, T) on a compact metric space. Let i be a line in the 
plane and let l r denote the set of points within distance r of I. Then I is said to 
be an expansive line if there exist r > and S > such that, for all x,y G X, 

d(T u x, T u y) < 6 for all u e C n Z 2 => x = y 

Expansiveness only depends on the direction of the line and not the line itself, so 
we may speak of expansive and non-expansive directions. Note that if I contains an 
integer point u € then expansiveness of I is equivalent to expansiveness of the 

map T u : X — > X, but for I with irrational slope there is no such interpretation!*! 

Expansive and non-expansive directions were defined by Boyle and Lind in [3], 
where they were used as a tool in the study of the directional dynamics of an 
action. Many properties of the dynamics of subactions T u vary nicely withing 
connected components of the set of expansive directions. For example, within 
such a component the entropy of subactions varies linearly. Certain properties 
are constant within expansive components: for example, if T u acts as a shift of 
finite type then so does T v as long as the directions u, v are in the same expansive 
component. 

One of the basic questions that arose in [5] was to understand what sets can 
occur as the set of non-expansive directions. It was shown that this set is closed 
and, when the phase space is infinite, non-empty; and furthermore if C is any closed 
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set of directions of cardinality |C| > 2, then it is the set of non-expansive directions 
for some action. 

It has been an open problem for some time to determine which direction can 
occur as the unique non-expansive direction in a non-trivial way. If one begins with 
an expansive Z- action (X, T) and extends it formally to the 1? action [X, (T, idx}) 
generated by T and the identity map, then one obtains an action whose unique non- 
expansive direction is the vertical one; but the action in that direction is trivial. 
One can construct similarly trivial examples in which an arbitrary rational direction 
is the only non-expansive one. However, attempts have not succeeded in producing 
non-trivial examples for rational directions (a proposed example in [6] turned out 
to be flawed, see [2]), or any examples at all of actions with a single irrational 
non-expansive direction. 

In this paper we resolve this problem as follows: 

Theorem. For every direction £ in the plane there is an expansive 1? action whose 
unique non-expansive direction is I. In the case I has rational slope, the correspond- 
ing subaction is non-trivial, i.e. none of its elements act as the identity. 

Corollary 1.1. A set of directions occurs as the set of non-expansive directions 
for an expansive 1? -action if and only if it is closed and non-empty. 

This follows by combining the theorem with Boyle and Lind's result for sets of size 
> 2, but can also be derived directly from our construction by taking unions of the 
systems it provides. The Boyle-Lind examples are also unions of a similar sort. Thus 
the constructions we have are quite degenerate, in the sense that they decompose 
into subsystems with small sets of non-expansive directions. The following question 
is therefore natural: 

Problem 1.2. Can every nonempty closed set of directions occur as the non- 
expansive directions of Z 2 -action that is transitive/minimal/supports a global er- 
godic measure? 

Another consequence of our construction is: 

Proposition 1.3. There exists a cellular automaton f such that, for every t, there 
is a subshift X t on which f acts as an automorphism without equicontinuity points 
and such that the Lyapunov exponents are A+ =t,\~ = -t. 

This answers to a question of Bressaud and Tisseur [4] in the special case t = 0, 
showing that even for cellular automata whose action is sensitive to initial condi- 
tions, information can propagate unboundedly, but at a sublinear rate, i.e. with 
zero speed. 
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Theorem Q] and the last proposition are related as follows. Suppose we wish to 
realize a line I as the unique non-expansive direction of a Z 2 -action. We shall do so 
on a zero-dimensional phase space. In this case we may fix an expansive direction 
with rational slope, and choose another rational direction so that together the 
actions in these directions generate the full action (or a finite-index subgroup of 
it). Using expansiveness we may re-code and identify the first direction with the 
shift on some symbolic space X, and the second direction with an automorphism 
of X. Thus the problem has been reduced to one of constructing an appropriate 
shift space and an automorphism of it; this is the same setting as is studied in the 
theory of cellular automata. 

This reduction highlights an interesting aspect of the problem. Each automor- 
phism is given by a block code. There are only countably many of these, but there 
are uncountably many directions (or values for Lyapunov exponents) . Thus if one 
is to construct examples of automorphisms which realize any given direction as the 
unique non-expansive one for the generated action (or if one wants to construct 
CA with arbitrary Lyapunov exponents), then the direction (or exponents) must 
be encoded at least in part in the subshift rather than the automorphism. We 
shall make this encoding quite explicit, effectively designing the automorphism as 
an interpreter and using the subshift as a program controlling the action of the 
automorphism. 

Our strategy will be to construct an automorphism that, roughly speaking, per- 
forms a sequence of shifts on the underlying space at a rate that is encoded in 
the subshift it is acting on; this rate is what will determine the slope of the non- 
expansive direction. For example, taking the full shift as our space, a as the shift 
and the automorphism tp = a n , we see that in the generated Z 2 -action every direc- 
tion is expansive except the line ny + x = 0. Here ip shifts at a rate of n symbols 
per unit time. We would like to control this rate so as to make it an arbitrary real 
number 0. The implementation of this simple idea, however, is rather involved. Our 
solution relies on a property that has been called intrinsic universality in the cellu- 
lar automata literature (e.g. [JJ). This means that there are automorphisms which, 
when restricted to an appropriate subshift, can simulate any other automorphism 
up to a temporal and spacial rescaling. We shall use an infinite hierarchy of such 
automorphisms, each of which simulates the next, and such that each level in the 
hierarchy performs a shift on the underlying space at a fixed rate (this construction 
is somewhat reminiscent of Gacs' error-correcting automata [5]). The sum of these 
rates will determine the overall shift and the non-expansive direction, and we will 
control these rates by encoding them into the shift space. 

We shall mostly use standard definitions and notation, which can be found e.g. 
in [10]. We denote by a the shift map on symbol spaces, and for a point x € S z we 
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denote its coordinates by Xi. For a symbol a we write a n for the n-fold concatenation 
of a. An automorphism tp of a subshift Y is given by a block code, and we shall 
say that tp has range r if the block code acts on a r-neighborhood [— r, r] (this is 
also sometimes called the radius of tp or its window width). The notation Ojv(l) 
denotes a constant depending only on N. 

The rest of this paper is organized as follows. In the next section we intro- 
duce a sufficient condition for the action generated by a shift-automorphism and 
the shift to have a unique non-expansive direction. Sections 03 outlines the main 
construction, section |4] supplies further details of the implementation. Section [5] 
applies the construction to prove the main theorem. Section [6] discusses the rela- 
tion and applications to Lyapunov exponents. Finally, in section [7| we present an 
simpler, alternative construction of a system with a unique, rational non-expansive 
direction. 

Acknowledgment: I would like to thank Doug Lind for some very interesting 
discussions and for his permission to include the example in section CO This work 
was done in the fall of 2008 during the special semester on additive combinatorics 
and ergodic theory at MSRI, and I would like to thank the organizers and hosts for 
that stimulating event. 

2. Prediction shapes 

In this section we define a device that quantifies quantify propagation of uncer- 
tainty under iteration of an automorphism. This device is related to Shereshevsky's 
notion of Lyapunov exponents for cellular automata [8], although there are a num- 
ber of differences. First, we are interested in the propagation of uncertainty both 
forward and backward in time, although one can easily modify our definitions so 
that they are one sided, and apply to endomorphisms as well. More importantly 
we measure uncertainty by fixing a finite block, rather than a one-sided infinite ray 
(in Shershevsky's case one fixes a leaf of the stable or unstable foliations). We shall 
discuss the relation to Lyapunov exponents further in section [6l 

Definition 2.1. Let Y be a subshift and tp an automorphism of Y. A convex, open 
subset A C R 2 is called a "prediction shape for tp\ Y if (0, 1) x {0} C A and for every 
compact set Ao C A and for all large enough n, if y, z S Y satisfy 

y\[-n,n] = z \[-n,n] 

then 

(tp t y) i = (tp t z) i for all (i,t) £ nA n Z 2 
where uAq = {n ■ u : u £ Ao}. 
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From the definition it is clear that the increasing union of prediction shapes is a 
prediction shape. 

If tp, tp~ x have range r then the diamond shaped region with vertices at (—1, 0) 
and (1,0) and with sides of slope ±l/r is a prediction shape for tp\y. This bound 
derives from information about the block-code and one may sometimes get more 
from the block code, but in general the prediction shapes for tp\y depend non- 
trivially on Y . For example, if Y is a finite union of periodic orbits then for large 
enough n the restriction y\[- n , n ] determines y 6 Y; therefore R 2 is a prediction 
region. On the other hand, for infinite subshifts it is easy to see that no prediction 
shape contains [—1,1] x {0} in its interior. 

Our application of this notion is the following simple observation. 

Theorem 2.2. Let Y be a subshift with automorphism p. Let £ be a line through 
the origin distinct from the x-axis, and suppose that 

A = e 1 = {u e M 2 : d(u,£) < 1} 

is a prediction shape for tp\y ■ Then every direction except £ is an expansive direction 
for the 1? -system (Y, (a, tp)). 

Proof. Fix a line £' through the origin in a different direction from I. We must 
show that if r is large enough then y\(eiy determines y. For this it suffices to show 
that there is an r such that y\^y determines y\(£>)r+i. 

Since the slopes of £, £' are different, it is easy to see that there is an £ > and a 
compact set Ao C A containing the origin with the property that (£') 1+£ C (i'y+Ao. 
The desired conclusion now follows from the fact that A is a prediction shape. See 
figure EU □ 

Corollary 2.3. Under the assumptions of the theorem, if Y is infinite then £ is 
the unique non-expansive direction for (Y, (a, tp)). 

Proof. This follows from the theorem and the fact that every Z 2 -action on an infinite 
space must have at least one non-expansive direction [3]. □ 

3. Main construction 

In this section and the next we construct a subshift X over an appropriate 
alphabet, and define a pair of endomorphisms 7r and 7? of X by specifying block codes 
for them. This section describes their properties and outlines the construction. 
Some further details appear in the next section. 

By construction, X, ir and n will satisfy the following properties. First, the range 
of 7r and 7? will be 1, meaning that 7r(a;)o depends only on X-i,Xq,Xi and similarly 
for 7r. 



6 



MICHAEL HOCHMAN 




(£>y 



Figure 2.1. By shifting the center of the dark region Ao along 
the line (! ', one covers (£') 1+£ (the area swept out by the darkened 
corners of A ). 

Second, suppose we are given the following parameters: 

• An integer N. 

• A subshift Y C {l,...,N} z . 

• Block codes of range 1 defining inverse automorphisms p,^ 1 of Y . 

• An integer B > 1 ("Block length"), which is sufficiently large with respect 
to N, ip. 

• An integer W > 1 ("Wait time"). 

• An integer D ("Displacement"), which may be positive or negative, indicat- 
ing displacement to the right or left respectively. 

Then there is a subshift 

X' = X'(Y, N, ip, B, W,D)CX 

such that 
and an integer 

(3.1) r=(B + 0^(l))(l + W + |X>|) 

so that (X',ir) and (Y, ip) are related in the following manner. Each configuration 
of x G X' breaks into blocks of length B each representing one symbol from the 
alphabet {l,...,N} of Y, and thus x £ X' encodes a point y £ Y. With this 
interpretation of x, the endomorphism 7r T acts on x in the same manner that 
<t d o ip acts on Y, i.e. it applies tp to the encoded sequence y without altering the 
block structure, resulting in a new point x' £ X encoding ip(y), and then shifts 
each block of x' a distance of D blocks, i.e. D ■ B symbols, mimicking the action of 




(j D on <p{y). More precisely, there is an isomorphism 

(3.2) (X',<j,n) S(7x{0,...,B-l}x {0, . . . , T - 1}, a B , <pr) 

where ob , <^t are the suspension maps defined by 

<J B {y,b,t) = ( cr{ b = >y , 6+1 mod B , t ) 

<pr{y,b,t) = ( (a D ip)V=Vy , 6 , t + 1 mod T ) 

Here we have denoted {6 = 0} = f^and {t = 0} = <5to- See figure l3~Tl 

What we have required of n, n is very similar to what is called intrinsic univer- 
sality, which has been studied in the CA literature (e.g.jl]), although the additional 
shift by D blocks is special to our construction. Such endomorphisms have been 
constructed many times, as well as automorphisms [7|. However, we have not found 
a reference that satisfies all of our requirements exactly, and for this reason and 
in the interest of completeness we provide an outline of the construction details. 
In this section we give an overview; in the next we give some of the finer details. 
However, the properties above are all we shall use about tt, n and one may prefer 
at this point skip ahead to section \E\ where we prove theorem [TJ 

We construct ir, tt in a manner independent of the parameters above; tt, tt will 
operate as "interpreters", and the other parameters will be encoded in the config- 
urations of X' so as to influence the way in which tt acts. We shall eventually 
set X = UX', where the union ranges over all choices of parameters. Notice that 
although X is larger than UX' , nonetheless n = n^ 1 on X, because this is true on 
each X' (and ir, n are given by the same block code on all of them). 

For the construction we shall assume that the parameters Y, N, <p, B, W, D are 
given, and describe the block codes for it, tt in a way that is independent of the 
parameters, and a subshift X' that depends on them. 

The alphabet of A" consists of quadruples of symbols, which we denote (b,p, s, d): 
here 6 stands for Block structure, p for Program, s for State and d for Data. The 
projection of a sequence onto each of these coordinates are layers: for example the 
sequence of data components is the data layer. 
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The symbols used in the data layer will include the symbols 0, 1, which we call 
bits. We shall represent each symbol of Y by a sequence of [logiV] bits followed 
by an appropriate terminating symbol; together we call such a sequence a word. 

The Data layer consists of two words, representing symbols from Y, starting 
at the left side of the block; the remaining space to the right of these words is filled 
with "blank" symbols. The first word in the pair is interpreted as the symbol of 
Y currently represented by the block, and the second as the symbol represented 
by the block in the previous cycle. There will be times when the data will be in a 
corrupt state, but even then it will be possible to recover the uncorrupted current 
and previous states from the other layers; see below. 

The Block layer is a periodic sequence whose period is B and is not modified by 
7r. At this stage we may take the block layer to be a periodic concatenation of the 
string 10 s-1 , and we shall call such a sequence, and also the indices it occupies, 
simply a block (later on we will add more information to this layer; see section 
15. In the identification X' = Y x {0, . . . , B - 1} x {0, . . . , T - 1}, the second 
component in the image of x € X' will be determined by the residue class mod 
B of the position of l's in the block layer of x. We remark that since B can be 
arbitrarily large, X will contain also points whose block layer consists of all 0's, or 
of a single 1 surrounded by 0's; but as we shall see these will not cause a problem. 

The Program layer is also periodic with period B and is not modified by n. 
The repeated sequence, which we call simply the Program, is constant throughout 
X' . The program begins at the left side with an encoding of the block codes of tp 
and tp -1 . This takes the form of a sequence of 5-tuples of words, representing 5- 
tuples of symbols from Y. Each 5-tuple represents an input (3 F-symbols) and the 
corresponding output symbol of tp and tp -1 (recall that both are assumed to have 
range 1). We separate these 5-tuples from each other with some special symbol, 
and terminate the sequence of 5-tuples with another special symbol. Next, the 
program layer contains the parameters W, D encoded as contiguous sequences of 
l's, either W or D in number, followed by terminating symbols. The remainder of 
the program layer is filled with blanks. 

Finally, the State layer contains auxiliary information used to interpret the 
Program layer and use it to update the Data layer. We call a sequence of state 
symbols corresponding to a Block simply a State-block. The state-blocks in different 
blocks typically differ from each other, since they depend on the data in the block 
and the neighboring blocks, but they will be synchronized in the following sense: 
there is a special state-block called the Synchronized State, so that once (and 
only once) every T applications of it, the synchronized state-block appears in all 
the blocks of a configuration x € X' . We shall call an x £ X' with all blocks 
synchronized a synchronized configuration. It is during such a time that the data 
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Figure 3.2. The arrangement of information inside a block. 



layer is guaranteed to represent correctly the current and previous symbols. Thus 
in the identification X = Y x {0, . . . , B — 1} x {0, . . . , T — 1}, the third component 
of the image of x is the number t of applications of n^ 1 needed to bring x to a 
synchronized configurations, and the first component is the current symbol in the 
data layer of of 7r~*x. 

The layout of a block is depicted in figure 13.21 

Applying tt repeatedly to a synchronized configuration x E X 1 leads to the 
following sequence of events, which we call a Cycle (figure l3~3|) : 

(1) Each block "transmits" its current data word to the neighboring blocks on 
its left and right, and receives the same information from them. At the end 
of this stage, the state contains two words, aL,ap, representing the current 
y-symbol encoded in the neighboring blocks to the left and right of the 
current block, respectively. 

This stage takes B + On(1) applications of ir to complete (each bit moves 
a distance of B, taking B time steps, plus a constant amount of time needed 
too coordinate the transmission which depends only on the number of bits 
being transmitted, which depends on N). 

(2) The two words from the data layer - the current and previous F-symbols 
- are copied to the state layer and simultaneously deleted from the data 
layer. If ac is the current symbol and ap the previous symbol of the block, 
then the state space now contains the 4-tuple a^acapap of F-symbols. 

This stage takes Oat(1) applications of ir to complete. 



10 



MICHAEL HOCHMAN 



(3) We now enter a loop in the course of which the 4-tuple in the state layer 
is translated to the right, stopping opposite each 5-tuple in the program 
layer. 

(a) For each 5-tuple it checks if the triple of words a^acaR matches the 
input-triple in the program layer. 

(b) When a match is found the words a,L,aR, and ap in the state layer are 
erased, and the corresponding output word b = ipo(aLacciR), which is 
encoded in the data layer, is copied to the state layer. The state layer 
now contains the current y-symbol a c and future F-symbol b. 

(c) The comparisons continue also after a match is found. The implemen- 
tation will be such that each comparison takes the same number of 
steps, whether or not a match is found. A match will be found exactly 
once. 

Since the comparisons continue at the same rate after a match is made, the 
number of applications of n in this stage is independent of the configura- 
tion, and this stage ends at the same point in the cycle for all blocks in a 
configuration. The time for this step is On )V (1). 

(4) The current and future ^-symbols are translated back through the state 
layer to the left end of the block and transferred to the data layer: b is 
copied to the "current" slot and ac to the "previous" slot, and they are 
simultaneously deleted from the state layer. 

This step takes 0_/v lV (l) applications of ir to complete. 

(5) The data layer is shifted \D\ blocks to the left or right, according to the sign 
of D. For this, the sign is first determined, and then a loop is performed 
during each iteration of which the data layer is shifted by one block length. 

This step takes (5+0^(1)) \D\ applications of ir; the B term corresponds 
to actually transporting the data. The Ojv(l) term is the overhead required 
each cycle. We give further details in the next section. 

(6) The state layer "mock-shifts" the data layer W more times, meaning that 
the state goes through a cycle which takes the same amount of time as 
shifting it one block, but doesn't result in such a shift taking place. 

This step takes {B + On(1))W applications of tt. 

(7) All blocks return to the synchronized state. 

This takes Oat(1) applications of it. 

Part of a cycle is depicted in figure 13.31 

Given the parameters Y, ip etc., let X" Clbe the set of synchronized configura- 
tions whose program layer corresponds to the given parameters and whose sequence 
of symbols encoded in the data layer correspond to points in y € Y and (p~ 1 (y). 
We then set X' = uJSq^X", where T is the length of one cycle. 
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Figure 3.3. Part of a cycle, including simulation (but not the shifting). 
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It is clear from this description that 7r T simulates ip, where T is the length of a 
cycle. The isomorphism l|3.2jl is given by 

x i-> (y,b,t) 

where b is the least non-negative integer such that —6 is the coordinate of the 
beginning of a block; t is the number of applications of ir^ 1 to x required to bring 
x to a synchronized configuration; and y 6 Y is the sequence encoded by n~ t (x), 
where yo is the symbol coded in the block to which belongs in 7T~ t (x). The length 
of a cycle is given by (|3.1|l . 

Note that the alphabet of X', and therefore of X, is independent of the parame- 
ters, and in particular of N. Thus we can simulate systems (Y, a, <p) on arbitrarily 
large alphabets by subsystems X' C X . The upshot is that in order to do so the 
parameter B must be large enough that the blocks can encode the parameters as 
described above. 

The implementation details of it are very similar to those involved in construct- 
ing a universal Turing machine, and are completely standard, with one exception: 
generally Turing machines are not reversible, yet we want 7r to act invertibly on 
X 1 and for n = n^ 1 to have range 1. With care this can be done. Notice that the 
only place where information is deleted in the scheme above is in the stage (J3bj , 
where the 5-tuple in the state layer matches the 4-tuple in the program layer. At 
this point certain information is erased from and added to the state layer, both are 
present in the program layer; this allows the process to be reversed locally (this is 
the reason tp^ 1 is encoded in the program layer; notice that it is not used explicitly 
in the definition of n) . The other steps - namely, the transferring of bits from one 
place to another, etc. - can be done invertibly with inverse having range 1. Thus 
we have achieved our stated goal. The bound l|3.ip for T follows easily from the 
construction. 



4. Implementation details 

This section outlines the realization of the automorphism it described in the 
previous section. It is provided for completeness and readers may prefer to skip 
ahead to the next section where the main construction is undertaken. CA simulating 
other CA have been constructed a number of times in the literature, e.g. [I], 
and are similar to universal Turing machines. The only new ingredient here is a 
careful analysis of certain aspects of the time complexity of the simulation, and 
our emphasis on invertibility. Although both aspects have been addressed in the 
literature, it is easier to indicate the construction than to explain how to modify 
existing ones to meet our specific needs. 
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As a complete implementation of it would be a very lengthy undertaking, we de- 
scribe only the part of the implementation responsible for the first stage of the cycle, 
in which the current data words are transmitted between immediate neighboring 
blocks. In this stage we already encounter the main ideas needed to complete the 
rest, and we provide a few hints about the other stages. 

The symbols of the state layer represent sets populated by agents. An agent 
is a finite state automaton. Each agent in each state-layer cell will, with each 
application of it, perform one or more of the following operations: (1) modify the 
symbols in the data layer, (2) update its internal state, (3) move one cell left or 
right. By design, not more than one agent per cell will attempt to modify the data 
layer, so no conflict will arise. The nature of the operation that an agent performs is 
determined by the other contents of its cell prior to the operation, including its own 
previous internal state, the states of the other agents in its cell, and the symbols in 
the data, program and block layers. 

There are two types of agents: 

• The main agent. There is one such agent per block, and it is the only agent 
capable of modifying the data layer. 

• Data agents. Used to store and transport data. We allow several types of 
data agents, which play slightly different roles, but they overall behavior is 
the same. 

We shall also add new symbols to the block layer. We call these roadsigns. Their 
role is to signal some event to the agents at that cell. 

Note that, since the laphabet of X may not depend on the parameters N, Y, ip, B, D, W, 
we may introduce only a finite number of agents and new symbols (roadsigns) ; but 
we may arrange them as we wish. This allows us a great deal of flexibility in pro- 
gramming the agents and providing them external cues to modify their behavior. 

The data agents role is to store and transport a single bit of data. Their internal 
state consists of a motion symbol ("left", "right", or "stationary") ; and a data symbol 
("0", "1" or "empty"). At each step, a data agent updates its state based on roadsigns 
in the block layer and instructions from the main agent, if it occupies the same cell 
as the data agent. Then the data agent takes one step left or right or stays in its 
current cell, according to its motion symbol. 

The main agent acts as coordinator, issuing instructions to data agents and 
modifying the data layer. Its operation is more complex, and it is our goal to 
describe some of it in detail. 

At the beginning of each cycle all agents are arranged as follows. The main agent 
is located in the leftmost cell of the block in an initial state that we call Si. At 
each site where the data layer contains a data bit from the current word, there are 
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two data agents, one of each type Left and one of type Right (not to be confused 
with their motion state!), with motion symbol "stationary" and empty data. 

The main agent is initially in a state si. While in this state he moves one cell to 
the right at each time step, and at each cell has the following effect: the data bit 
from the data layer is copied to the data agents, and the data agents are "launched", 
i.e. their motion symbol is set as appropriate (the motion state of the L:eft agent 
is set to "left", and that of the other to "right") . The end of the current data word 
is indicated by a roadsign (special symbol in the block layer), and when reached it 
causes the main agent to enter internal state s 2 . 

At this point there are two new copies of the current data word, encoded in 
sequences of agents who are marching left and right at unit speed, forming what we 
shall call "caravans". We would like the main agent to meet the caravans arriving 
in its block and cause them to stop. Thus he should do "nothing" for a while, and 
arrive at a designated spot at a designated time to receive the first caravan. Since 
entering a state of inactivity and staying there is not an invertible operation we 
instead have the agent walk to the right while in state s 2 until it reaches a specially 
placed roadsign. At this point it enters state s 2 , walks left until it reaches another 
designated roadsign, and enters state S3. Since the transitions are controled by 
encounters with specific roadsigns they are invertible, and by by controlling the 
positions of the roadsigns we can determine the time and place at which the agent 
enters state S3. 

We assume that the main agent enters state S3 just as the first data agent in 
caravan of data agents from the block to the right is arriving at the cell where 
it is to stop. In state S3 the main agent walks to the right, and whenever it 
shares a left-moving data agent it sets its motion symbol to "stationary". Thus the 
main agent will cause the left-moving caravan to halt. A roadsign indicates to the 
main agent that it has reached the last data agent in the caravan (this position is 
completely determined by the parameters and the choice of roadsigns so far) ; when 
this roadsign is reached the main agent enters state S4. 

In state S4 the main agent behaves similarly, moving left and stopping data 
agents arriving from the right. When the last data agent is halted, a roadsign 
forces the main agent into a new state S5. 

At this point we have completed our goal: the state layer of each block contains 
data words from the current block and its two neighbors. The time elapsed from 
the beginning of the cycle is B + On(1); the B term is the time it actually takes 
each data bit to travel, and the Ojy(l) is an adjustment encompassing overhead 
and the fact that slightly less than B may have been traveled (we have some choice 
about where to halt the caravans). 
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With regard invertibility, note that given the state Si in which the main agent 
is found, the effect on data agents is invertible, and the transitions between Si to 
is controlled by roadsigns, and is invertible as well. 
We conclude this outline with some further comments. 

In our example we have not demontrated the deletion of bits from the data or 
state layers. To make this invertible each deleted bit must be present in some other 
form in the same cell. For example, when transferring the current data word to the 
state layer the main agent will delete a bit from the state layer while at the same 
time recording it in a data agent; thus this operation can be reversed. 

The next step in the cycle is to transfer the data-agents, representing three data- 
words, to the right, stopping opposite each corresponding triple in the program 
layer. This is similar to the transfer we just performed, except there is no main 
agent on the receiving end to halt the caravans. This is solved by positioning 
roadsigns that cause the data agents to reverse direction. These act as "reflecting 
walls", and allow the main agent who launched the caravan to also halt them (the 
order of bits in the data words is reversed; one can either work with this reversal, 
or perform the reflection twice). 

Another point that needs some care is the comparison stage, at which the three 
data words represented by the data agents are compared to the corresponding 
words in the program layer. When comparing the input triple in the program to 
the corresponding triple in the state layer, one cannot simply traverse them both 
from left to right, say, and take note if they differ at some point, because this is 
not reversible (when going backwards, you are in a state of knowing that there is 
a differing pair of symbols until you reach the leftmost such pair. After that your 
state is that of not yet having seen a difference. But there is no way to know, 
when you reach a differing pair, if it is the leftmost such pair or not). To overcome 
this, one begins on the left, say, and puts down markings in the state layer: green 
until the first difference, if there is one, and red thereafter. When the end of the 
comparison is reached we note the current color, and then go back, right to left, 
and erase the color markings. This procedure is invertible with range 1. 

Lastly, we discuss how to control the number of iterations of the "shifting" stage, 
which must occur D and W times. Perhaps the simplest is that, during the time 
that the data is being transferred, the main agent "counts down". This can be done 
in the simplest of ways, by coding D and W into the program layer as a sequence 
of D or W special symbols, respectively; transferring them to the state layer (as 
stationary data agents); and "crossing one out", i.e. resetting one of them, with 
each "shift" iteration. 
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5. Realizing unique non-expansive directions 

Before continuing, let us make some observations about the construction above. 
The following is clear from the construction: 

Lemma 5.1. Suppose Xi C X is constructed from the parameters Yi, <p, N, B, W, D. 
Suppose that Y 2 C Y\ is a- and ip-invariant, and let X 2 be constructed using pa- 
rameters Y 2 ,ip, N, B, W, D. Then X 2 < = X 1 . 

Lemma. tt\x = Mx) -1 

Proof. Recall that X = UX', the union being over all systems constructed from 
permissible parameters. The lemma follows from the fact that for each such X' we 
have tt\x' — (7r|x') -1 > and in all cases ir is given by the same block code. □ 

Next, we relate the prediction shapes of ip\y and tt\x>. 

Lemma 5.2. Suppose X' C X is constructed from the parameters Y, tp, N, B, W, D. 
Let A be a prediction shape for ip\y ■ Then A(A) is a prediction shape for ir\x' , where 
A : M 2 — > M. 2 is the linear map fixing e\ = (1, 0) T and mapping e 2 = (0, 1) T to the 
vector (D,T/B), or in matrix form, 

A=h D ' 

T/B 

Proof. This is immediate from the identification 

{X', a, tt) S (Y x {0, . . . , B - 1} x {0, . . . , T - 1}, a B , (fr) □ 

We now undertake the main construction of this section, and proceed to an- 
alyze it. Let N be the number of symbols in the alphabet of X. Fix a se- 
quence B n , D n , W n of parameters for the construction above, with Wk > 2 (hence 
T k /B k > 2). 

For a subshift Y C X invariant under tt, let Z n (Y) denote the system X' con- 
structed above with parameters N, Y, tt, B n , D n , W n . Define 

Z?+ k = Z n (Z n+1 (...(Z n+k (X))...)). 

An induction using lemma EH] shows that Z™ +k+1 C Z" +fc ; thus 

oo 

— | | 

k=l 

is non-empty and a- and tt- invariant. Finally, set 

Z = Z™. 

It is easy to see that we have the relation 

(5.1) Z = Z? = Z^Z?) = Z x {Z 2 {Zf)) = ... 
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etc. 

We will now show that there is a (necessarily unique) line I through the origin 
so that 

A = f 1 = {«el 2 : d(u,t) < 1} 

is a prediction shape of Z, and calculate the slope of I. It suffices to write A as an 
increasing union of prediction shapes for ir\Z. 

Let A denote the unit ball in M 2 with the norm || - 1| 1 , which is a prediction 
shape for ir and any ir- invariant subshift of X. Let A n : M. 2 — > M 2 denote the map 
associated as in lemma HT2l to Z n {-). Since Z^ = Z n {Z^ +1 ) and A is a prediction 
shape for Z^ +1 , it follows from lemma H21 that A n (A) is a prediction shape for 
Z!^. Therefore since Z^L X = Z ra _i(Z£°) the same lemma gives that J 4„_i(^4„A) is 
a prediction shape for Z^_ lt and iterating we have that 

A„ := A ± A 2 . . . A n {A) 

is a prediction shape for Z = Z^° . 

Notice that each the shape A„ is a quadrilateral having two vertices, (—1, 0) and 
(1,0), on the x-axis, one vertex above the x-axis, and one below it. We shall now 
analyze the asymptotic behavior of these vertices. Let A be the matrix in lemma 
i.e. it is of the form 

r 1 D 

T/B 

for integer parameters D,T 7 B and T/B. Let {x,y) T E IR 2 with y > and 
(x',y') T = A{x,y). Then 

x' _ DB B x 
V' T T y 

Now fix n and let (x n ,y n ) T denote the vertex of the quadrilateral A„ that lies 
in the upper half plane. Then 

(x„,y„) T =Ai...An(0,l) 

so x n /y n is given by 

x n DiBi , Si [D 2 B 2 , B 2 ( [D n B n , B n 



A 



y n Ti Ti \ T 2 T 2 \ \ T n T n 1 

or, using p.ip and writing 

DkB k D k n B k 

ak = — = (1 + £fc) | „ i ; T „ and 



T k v '\D k \ + W k T fc (l + e k )(\D k \+W k ) 

where e k = On, v (^), we have 



x 



oi + /3i(a 2 + P 2 (. ..{a n + p n - 0)) . . .) 

Vn 
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Since Tk/Bk > 2 for all k by our choice of parameters, (3k < 1/2, so this sequence 
converges to some A 6 (—1,1). For the same reason we have y n > 2™ and thus for 
large enough n we have x n — > oo. Since the slopes of the two sides of A„ which lie 
in the upper half plane are 

Un t Vn 

and 



x n 1 x n -\- 1 

it follows that that these slopes converge to the common value 1/A. 

A similar calculation shows that as n — > oo the remaining vertex of A„ grows 
unboundedly in norm and that the slope of the remaining two sides converges to 
1/A as well. 

Finally, it follows that there is an increasing subsequence of the A^'s whose union 
is (necessarily) the set A = i 1 , where £ is the line with slope 1/A. 
In summary, we have: 

Theorem 5.3. (Z, (a, 7i")) has a unique non-expansive direction whose slope is 1/A, 
where 

A = oli + f3i(a 2 + (3 2 (a 3 + ... (a n + (3 n {. . .))) . . .) 
and otk, [3k,£k are as given above. 

Proof. Theorem [221 implies that any line with slope different from 1/A is expansive 
(and the horizontal direction is as well, since a acts expansively by definition). That 
the line with slope 1/A is a non-expansive direction then follows from the fact that 
any infinite system has non-expansive directions, and the following claim. □ 

Claim 5.4. For any choice of parameters, the system Z is infinite. 



Proof. From the representation (|3.2|) . we see that every choice of z 2 & Z%° is rep- 
resented, modulo cr-shifts, by T±B\ points Z\ = z\{z2) G Z. Thus \Z\ = \Z^°\ > 
T x B x \Zf\. Similarly, |Z 2 °°| > T 2 B 2 \Zf\, so \Z\ > T x B x T 2 B 2 \Zf\; and so one. 
Since T n B n > 2 for each n, the conclusion follows. □ 

We have shown that out construction yields systems with a unique non-expansive 
direction of the form 1/A. It remains to show that any direction can be attained. 
The following an elementary exercise in representing reals: 

Lemma 5.5. For any real number \6\ < 1 occurs as the number A for some choice 
of the parameters Bk, Wk, Dk, and we may choose Wk > 2. 

Proof. We assume for convenience that 9 > 0; the cast 9 < follows similarly, 
the only difference being that all the D's are then negative and the endpoints of 
segments must appear in the reverse order. 

For some pair of integers W > 1 and D > 0, the number lies between \ D \+ W+1 
and ■ Taking W\ =W,D\=D and taking B\ to be sufficiently large, we 
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can make the error arbitrarily small and obtain 

6 G (a\, ot\ + (3i) 

One proceeds inductively to choose B^W^D^ so that 

9 - oi\ . 

G (a 2 ,a 2 + P2) 



implying that 

9 G (Oil + /3l«2, CCl + /?l(«2 + /?2)) 

and so on (note that the closure of each interval is in the interior of the previous 
one) . □ 

Theorem 15.31 and lemma IBTBI show that any line with slope 9, \9\ > 1, occurs 
as the unique non-expansive direction of some action. All other directions can be 
attained from this result by re-parametrizing the acting group. This completes the 
proof of the main part of theorem [TJ 

It remains only to show that in the case of a rational direction the action in that 
direction is non-trivial. This is shown in the same way as the proof of claim I5.4[ 
that Z is infinite; we omit the details. 

6. LYAPUNOV EXPONENTS 

Given a subshift Y and an endomorphism ip : Y — > Y, Shereshevsky [8] defined 
the Lyapunov exponents A + , A~ as follows. For y G Y define 

= min{n : Vz G Y VO < s < t (z|[_ n ,oo) = y\[-n,ao) => (<p" ^)|[o,oo) = (v s 2/)|[o,oo))} 
and similarly 

Itiv) = min{n : Vz G Y VO < s < t (2: | = J/|(-oo,«] =^ (v'^k-oo.o] = (<^*y)l(-oo,o])} 
Set 

= max max if (a l y) 
The Lyapunov exponents are then defined by 

A± 

A ± = liminf-^- 

t— too t 

(Shereshevsky's original definition of A differs from the above but is equivalent by 
®)- 

We omit the proof of the following, which is an immediate consequence of the 
definitions: 

Proposition 6.1. <p : Y — ► Y be an automorphism of an infinite subshift Y and 
let A be a prediction shape for ip\y ■ Let 9 + ,9~ denote the (possibly infinite) slopes 
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of the right- and left-tangent rays to OA at (—1,0) and (1,0), respectively. Then 
X+ < 1/9+ and \- < -1/9- . 

Corollary 6.2. // the strip A = {(x,y) : \x\ < 1} is a prediction shape for ip\y , 
then X+ =\- = 0. 

Tisseur [9] and later Tisseur and Bressaud [4] studied the relation between Lya- 
punov exponents, particularly the case of zero Lyapunov exponent, and the exis- 
tence of equicontinuity points for the action of p. Let us recall some definitions. 
For an endomorphism p acting on a subshift Y C £ z , we say that a finite word 
a G S™ is a blocking word if, for any pair y, z G Y with y|[i, n ] = z|[i, n ] = a 
and y|[i : oo) = z|[i,oo)> we also have [i ;OC ) = (p*z)l[i,oo) for all t; and also for 

any y,z satisfying y|[_„_i] = z|[- n ,-i] = a and y|(_oo,-i] = z|(-oo,-i] we have 
((/?*2/)|(_ 00i „i] = (^z)^-^,-!]. The condition that tp have equicontinuity points is 
equivalent to p having a blocking word. Also, not having equicontinuity points is 
equivalent to p acting on Y with sensitive dependence on initial conditions. 

Returning to the matter at hand, Bressaud and Tisseur conjectured that when <p 
is a cellular automaton acting sensitively (i.e. without equicontinuity points) on Y, 
some point y G Y has liminf^oo I t + (y) > or liminft^oo If{y) > (01 Conjecture 
3]). Our construction provides a counterexample for this conjecture, as we describe 
next. We continue to use the notation introduced in the previous section during 
the construction of Z. 

Proposition 6.3. The action of n on Z does not have equicontinuity points. 

Proof. Suppose that x were an equicontinuity point. Then there is an n so that if 
x(i) = y(i) for \i\ < n and y G Z then 7r*x(0) = 7r*y(0) for all t. We show this is 
impossible. 

Consider the block a = x\[-i^y This block contains certain information about 
the configuration of the one or two blocks in Z which intersect the coordinates 
[— n, n]. This in turn may encode some information about the location and state 
of the one or two blocks in whose state are encoded in the two blocks in the 
first level of Z; and so on. But since a is a finite block, there must be a k so 
that a contains no information about the blocks whose encoding it intersects in 
Z^ 3 . Now note that we can choose y so that the state of these Z£°-blocks in y 
differs from their state in x. Thus there is some bit in these Zj^-blocks that differs. 
This bit will eventually be transported across a; therefore there will be a t so that 
7r*y(0) ^ 7r*a;(0). □ 

This last proposition may seem surprising, since the dynamics of (Z, it) appear 
at first glance to be almost periodic. However, it is not in reality so. 
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7. AN ALTERNATIVE CONSTRUCTION IN THE RATIONAL CASE 

We present here a construction that arose in discussions with Doug Lind and 
provides a simpler example of a system whose unique non-expansive direction is the 
vertical axis, but no power of the action in that direction is the identity (though in 
other ways the action is dynamically rather trivial). This example is significantly 
simpler than the one above and may be adapted to give examples in other rational 
directions, but we have been unable to get any irrational direction with this method. 
It is striking to us that the irrational case is so much more difficult than the rational 
one, and it would be interesting if a simpler construction for that case were found. 

As before, we construct a subshift X C E z and an automorphism it : X — > X 
such that the vertical strip of width 2 around the y-axis is a prediction shape for 

7T. 

Fix a parameter n E N. The alphabet E consists of the symbols 

(blank) 

— <— (arrows) 

k k 

[, ] for < k < n (brackets, with counter k) 

k k 

[, ] for < k < n — 1 (marked brackets, with counter k) 

* * 

Each configuration in X will have at most one arrow symbol in it; the rest will be 
blanks and brackets. Adjacent brackets will not be allowed, instead between any 
pair of brackets there will always be at least one blank or arrow symbol. 

We shall later describe the configurations of X in more detail, but first we define 
the automorphism it by giving the relevant transitions, from which a range-2 block 
code may be derived. The transitions are 

(7.1) — > — becomes ► 

n n—X 

(7.2) — ► [— becomes — [ — > 

k k-l 

(7.3) — > ] - becomes <— ] - if k > 

n 

(7.4) — > ]— becomes — ] — > 

k fe-l 

(7.5) - [ <— becomse - [ — > if k > 

n 

(7.6) — [ «— becomes — [— > 

o 

together with the symmetric rules obtained by reversing left and right, e.g. — [ <— 

n 

becomes <— [— (reversal of (|7.4p ). 
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To interpret this, one may imagine that the arrow represents an agent walking in 
a landscape of brackets, proceeding in the direction the arrow points to (rule (|7.1|0 . 
The behavior of the agent when it encounters a bracket depends on the orientation 
and the data on the bracket. 

• When the agent approaches a bracket from the "inside", 

— If the bracket has positive counter, the agent decrements the counter 
by 1 and turns around (rule l|7.3 p .(|7,5 p l. 

— If the counter is then it is reset to n; next, if the bracket was marked 
the agent removes the mark and turns around, but if it was unmarked 
the agent passes through (rule l|7.4|l . (|7.6p ). 

• When the agent approaches a bracket from the "outside" it marks the 
bracket, decrements the counter, and passes through (rule I7.2j l. We shall 
arrange that such an encounter only occurs when the counter is n and the 
bracket is unmarked. 

n n 

For example, starting from the pattern —>■[ — ] — the agent will "enter" the region 
between the brackets, reverse its direction 2n times, and emerge from the right side. 
Note that upon its exit it leaves behind the configuration as he found it, i.e. the 

n n 

final pattern is — [ — ] — 

We next describe the allowable arrangement of brackets in X. We first define 
special sequences of "plain" brackets [, ] which are arranged in a hierarchical manner. 
Begin by choosing a periodic subset h C Z of period 2 (there are two ways to do 
this), and set the symbols y^i £ I to be alternately [ and ] (this can again be done 
in two ways). Half of the symbols in Z \ I\ are now trapped between matching 
brackets; let I' 2 denote the half which is not, which is a coset of 4Z. Next, choose a 
subset h Q I' 2 of relative period 2 (a coset of 8Z; there are two choices) and define 
£ h to be alternately [ and ] (again two choices). Let I' z C Z\ (I\ U/2) be those 
indices which have not yet been determined, and which are not trapped between 
matching brackets, and choose I3 C I' z a subset of relative period 2 (a coset of 32Z). 
Proceed in this manner to define m for i E I3 and I^h, etc. After carrying this out 
for all n every i S Z, with possibly one exception, is trapped between some pair of 
brackets; the remaining point, if it exists, may be left blank or given the symbol [ or 
] . Three steps in the construction of such a y appears below (big brackets indicate 
the addition at each stage). 
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We define a pre-block to be a subword of an hierarchical arrangement as above, 
which consists of a matched pair of brackets and the region between them. The 
pattern [— ] is a pre-block, and we call it the level-0 pre-block; next is [[— ] — [— ]], 
a level-1 pre-block; in general, a pre-block containing level-n pre-blocks but no 
level- (n + 1) pre-block is a level- (n + 1) pre-block. 

Next, define a level-n block to be the word obtained from a level-n pre-block by 
inserting a blank in between every pair of symbols. 

We now define the admissible words in X. Let a be a block, and consider the 
patterns — > a— and —a <— , which we extend with blanks in both directions (but 
we suppress these blanks notationally) . It is easy to verify that after finitely many 
iterations of ir we get the patterns the —a — » and <— a—, respectively. Let L(a) 
denote the set of intermediate patterns obtained in this way. We define X to be the 
subshift such that every finite word in X appears as a subword of some b e L(a), 
for some block a. 

It is not hard to check that for each x € X there is a coset of 2Z on which there 
appears a hierarchical configuration of brackets in the sense above, with brackets 
now carrying counters and markings. On the complementary coset there appear 
only blanks and possibly an arrow. One can also verify that if x 6 X contains a 
block that does not contain an arrow, then that block consists of unmarked brackets 
with counters equal to n. 

The point of the construction is the following. The changes that occur in a 
configuration under it occur only at the site of an arrow or adjacent to an arrow, so 
in order to understand the propagation of perturbations to a configuration under 
7r we must understand is the rate at which the arrow moves. For this, note that in 
order to "pass through" the block 

n n 
«1 = [ ] 

requires 6n steps. Now consider the block 

n n n n n n 

02 = [ - [ ] [ ]-] 

To pass through this requires the arrow to go back and forth 2n times between the 
external brackets; each time it must cross the inner two brackets twice, taking 6n 
steps each time. Thus to cross a 2 requires 2n ■ (5 + 2 • 6n). 

Continuing in this way, one may show that the time to cross the level-n block 
a,k is (c„) fe , where c„ — > oc with n. On the other hand, the width of is d k for 
a constant d independent of n, and the blocks appear periodically with period 
d k in any configuration of X. It follows that in order to travel a distance of Nd k 
will require time on the order of N{c n ) k , i.e. over large scales the rate of travel 
is logarithmic (we remark that this is the slowest possible rate; if the rate were 
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sub-logarithmic we would have, counting configurations, that the action of it were 
periodic). 

In particular, if we know the configuration x|[_jv.jv] for x € X we can predict 
x \[-N+iogN,N-U>gN] up to time O(N). It follows that the vertical strip of width 2 
is a prediction shape for X, as desired. We omit the details. 

Finally, since the arrow does travel arbitrarily far in some configurations (in fact, 
in any configuration containing the arrow), it follows that the action of ir is not 
periodic. We remark, however, that the dynamics of ir are in other ways rather 
trivial, e.g. all invariant measures are concentrated on fixed points, and there are 
uncountable many of these (the configurations without an arrow) . 
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